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We establish the large deviation principle for stochastic differential equations with av- 
eraging in the case when all coefficients of the fast component depend on the slow one, 
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including diffusion. 



1 Introduction 

This is a corrected version of the paper Veretennikov (1999). We consider the 
SDE system 

dX t = f(X t ,Y t )dt, X = x , 
dY t = s- 2 B(X t , Y t )dt + e-'CiXt, Y t )dW t , Y = y . (1) 

Here X t e E , Y t e M, M is a compact manifold of dimension I (e.g. torus 
T e ), f is a function with values in <i-dimensional Euclidean space E , B is a 
function with values in TM, C is a function with values in (TM) e (i.e., in local 
coordinates an I x I matrix), Wt is an ^-dimensional Wiener process on some 
probability space (Q,F,P), e > is a small parameter. Concerning SDE's on 
manifolds we refer to Watanabe and Ikeda (1989). 

The large deviation principle (LDP) for such systems with a "full depen- 
dence", that is, C(X t ,Y t ), was not treated earlier. Only the case C(Y t ) was 
considered in papers by Freidlin (1976), Freidlin (1978), Freidlin and Wentzell 
(1984) for a compact state space and by Veretennikov (1994) for a non-compact 
one. There well, recent papers on more general systems with small ad- 

ditive diffusions by Liptser and by the author which also only concern the case 
C{Y t ). 

The LDP for systems like (0) is important in averaging and homogenization, 
in the KPP equation theory, for stochastic approximation algorithms with 
averaging and so forth. The problem of an LDP for the case C(X t ,Y t ) has 
arisen since Freidlin (1976), Freidlin (1978). Intuitively, the scheme used for 
C(Y t ) should work; at least, almost all main steps go well. Indeed, there was 
only one lacuna; the use of Girsanov's transformation did not allow freezing of 
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X t if C depended on the slow motion while it worked well and very naturally 
for the drift B(X t ,Y t ). Yet the problem remained unresolved for years and 
the answer was not clear at all. Notice that this difficulty does not appear 
in analogous discrete-time systems [see Gulinsky and Veretennikov (1993), 
Chapter 11]. 

It turned out that the use of Girsanov's transformation in some sense pre- 
vented resolving the problem. Our approach in this paper is based on a new 
technical lemma, Lemma El below. The main idea is to use two different scales 
of partitions of the interval [0, T], a "first-order partition" by points A, 2A, . . ., 
which do not depend on the small parameter e and "second-order partitions" 
which depend on e in a special way, by points e 2 t(e), 2e 2 t(e), .... Then the 
exponential estimates needed for the proof of the result can be established by 
two steps. First, the estimates for a "small" partition interval are derived using 
the uniform bound of Lemma El (see below) and the estimates for stochastic 
integrals. It is important that in the "second" scale the fast motion is still close 
enough to its frozen version [the bound (fT5|) below]. Second, the bounds for 
"small" partitions and induction give one the estimate for a "large" partition 
interval. 

The original proof in the paper 1999 contains a gap. It relates to a bound- 
edness of some auxiliary constant h in the proof, - in the original version this 
constant may depend implicitly on the partition size A ; hence generating a 
vicious circle. The main aim of this version of the paper is to present the 
"patch" . The correction uses improved approximations that keep this constant 
b bounded in the lower and upper bounds, and it uses also a truncated Legendre 
transformation in the upper bound. The author is deeply indebted to Professor 
Yuri Kifer for discovering this vicious circle in the original version of the pa- 
per. The main technical tool remains the Lemma\^ All standing assumptions 
are the same as in the original version. 

The main result is stated in Section 2. In Section 3 we expose auxiliary 
lemmas, among them the main technical Lemma |S] with its proof and a version 
of an important lemma from Freidlin and Wentzell (1984) (see LemmaEJ) which 
requires certain comments. Those comments along with other related remarks 
are given in the Appendix, the latter has been also slightly extended. The 
proof of the main theorem is presented in Section 4. 

2 Main result 

We make the following assumptions. 

(Af) The function / is bounded and satisfies the Lipschitz condition. 

(Aq) The function CC* is bounded, uniformly nondegenerate, C satisfies the 
Lipschitz condition. 

(Ab) The function B is bounded and satisfies the Lipschitz condition. 

Some conditions may be relaxed; for example, B can be locally bounded, 
C locally (w.r.t. x) nondegenerate and so on. 
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The family of processes X s satisfies a large deviation principle in the space 
C([0, T]; R d ) with a normalizing coefficient e~ 2 and a rate function S(ip) if 
three conditions are satisfied: 

\imsnpe 2 \ogP x (X £ e F) < -MS(if), VF closed , (2) 
lim inf e 2 log P x (X s e G) > - inf S(<p) , VG open , (3) 

e— >0 G 

and 5 is a "good" rate function; that is, for any s > 0, the set 

$( S ):=(^C([0,T];iJ d ): %)<s, ¥>(0) = x) 

is compact in C([0,T]; _R d ). In fact, we will establish the following equivalent 
set of assertions due to Freidlin and Wentzell: 

lim sup lim sups 2 log P x (p ^(^ > 5) < -s, Vs > 0, (4) 

where $(s) := G C[0,T; S(<p) < s}, and 

lim inf liminfe 2 logP a .(p ,r(A' e ,<p) < 8) > S(<p), Vy?, (5) 

5 — >0 £ — >0 

where S is a "good" rate function (see above). 

Let W t = £~ 1 W / " te 2, y t = Y t£ 2, x t = X t£ 2 and let y* denote a solution of SDE, 

dy* = B(x, y x t )dt + C(x, y*)dW t , y x = y . (6) 

Theorem 1 Let (Af), (A B ), A c ) be satisfied. Then the family (Xf = X t , < 
t < T) satisfies the LDP as e — > in the space C([0, T]; R d ) with an action 
function 



S(<p) = / L((p t ,(p t )dt, 
Jo 



where 

L(x, a) = sup(a(3 — H(x, /?)), 


H(x, (5) = lim t~ l logFexp ( / f(x,y x )ds 

The limit H exists and is finite for any (3, the functions H and L are convex 
in their last arguments f3 and a correspondingly, L > and H is continuously 
differentiable in (3. 

The differentiability of H at any (3 is provided by the compactness of the 
state space of the fast component. 
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3 Auxiliary lemmas 

Let us consider the semigroup of operators Tf , t > 0, on C(M) defined by the 
formula 



where (3 G E d ) (3f is a scalar product. 

Lemma 1 Let assumptions (Af), (As), (Ac) be satisfied. Then for any f3 the 
operator Tf is compact in the space C([0, T]; R d ). 

Lemma 2 Lei assumptions (Af), (As), (Ac) be satisfied. Then the spectral 
radius r(Tf) is a simple eigenvalue of Tf separated from the rest of the spec- 
trum and its eig en- function ep belongs to the cone C + (M). Moreover, function 
r(Tf) is smooth (of C°°) in (3 and the function ep is bounded and separated 
away from zero uniformly in \(5\ < b and any x',x. 

Lemma 3 Let (3 G E d , and let assumptions (Af), (Ab), (Ac) be satisfied. 
Then there exists a limit 



moreover, H(x',x,(3) = log r(Tf ' x ' 13 ). The function H(x',x,(3) is of C°° in (3 
and convex in f3. For any b > there exists C(b) such that, for any y, \/3\ < b 



Notice that \H(x',x,p)\ < \\f\\c\P\- 

Lemma 4 Let assumptions (Af), (A B ), (A c ) be satisfied. Then for any b > 
the functions H and V ' @H are uniformly continuous in (x',x,j3), \/3\ < b. 

Lemmas 1-4 are standard [cf. Veretennikov (1994) or (1992)]. They are 
based on Frobenius-type theorems for positive compact operators [see Kras- 
nosel'skii, Lifshitz and Sobolev (1989)] and the theory of perturbations of linear 
operators [see Kato (1976), Chapter 2]. Denote F t = F t£ 2. 

Lemma 5 Let assumptions (Af), (A B ), (A c ), b > 0, t(e) — > oo and t(e) = 
o(log£ _1 ) as e — > 0. Then for any v > there exist 5(u) > 0, e(u) > 
such that for e < e(u) uniformly w.r.t. t , x', x, x , y , x to = x , \/3\ < b, the 
inequality holds on the set {\x to — x\ < 5(u)}, 







to+t(e) 




f(x',y s )ds)\F t0 )-t(e)H(x',x,(3) < ut(e). 



(8) 



to 
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Moreover, if A < A(v) = (1 + ||/||c) 1( ^( z/ )/2 and e is small enough, then 
uniformly w.r.t. to, x' , x, xq, yo, 5 < b~{v), \xq — x\ < 5, and |/3| < b, 



exp(e- 2 AH(x',x,/3) - vAe 



-2> 



t +A 



<E [exp(Pe- 2 f f(x',Y s )ds)\F t0 
< exp(£- 2 AH(x',x,(3) + uAe' 2 ) 



(9) 



Remark. We reiterate and emphasize that any couple (A, 5) satisfying only 
A < A(z/) and 5 < 8{p) would do. 

Proof. Step 1. It is sufficient to prove (jSJ) and for t = 0. Moreover, since 
H is continuous, it suffices to check both inequalities for x = xq. Indeed, the 
bound 



( 7 \ 

logEexp /3 I f(x ,y s )ds - t(e)H(x ,x , P) 



< vt(e) 



implies 



logEexp 1/3 / f(x', y s )ds\ - t(e)H(x', x, P) 

<t(e)(u+\H(x',x,P)-H(x',x ,P)\), 

and we use the uniform continuity of the function H on compact sets (remind 
that |/3 1 < b). The same arguments are applicable to the second inequality of 
the assertion of the lemma. So, in the sequel we consider the case xq = x. 
Let us show first that 



sup 

X 1 ,XQ 



^-MogEexp (p f{x',y s )ds\ -H(x',x,P) 



< v (10) 



if e is small enough. Due to Lemma El it would be correct if y s were replaced 
by y* and t(e) > v~ l C(b). We will also use the bounds 

sup \x s - x \ < \\f\\ C £ 2 t VC, exp(Ct(e))t(e) 2 e 2 -»• 0, e -> 0. (11) 

0<s<t 



Let \f(x',y) - f(x',y')\ < L f \y-y'\ for all y,y',x', L f > 0, C f = \\f\\ c . We 
estimate for t(e) > v~ l C{b)/A, 



Eexp ^P J f(x',y s )ds 
] + / f sup \y t -yf \ > v/{ALfb) J I 

J \o<t<m j j 

< Eexp (p f f(x', y:)ds + t(e)v/±\ I ( sup \y t - y*\ < u/(4L f b)\ 



sup \y t -y*\<v/{AL f b) 

0<t<t(e) 



) 



+ exp(Cfbt(e)v)EI [ sup \y t - yf\ > v/{AL f b) 

y O<t<t{s) J 
tie) 

< Eexp | (3 [ f(x',y x s )ds I exp(t(e)z//4) 



+ exp (C f bt(e)u) v~ 2 E sup \y t - yf\ 2 .(12) 

t<t(e) 



By virtue of Lemma El we have 

t(e) 



Eexp \p J f(x',y* s )dsj < ex V (t(e)(H(x', x, (3) + u/i)) 

if e is small enough. A similar lower bound holds true also. 

Let us estimate the second term. By virtue of the inequalities for the Ito 
and Lebesgue integrals, we have 



£sup \y t > - y x t , 
t'<t 



x\2 



t 

x\\\2. 



<CE J \C(x s ,y s )-C(x } y:))\ 2 ds 
o 

+CtE J \B(x a ,y 8 ) - B(x,y x s ))\ 2 ds 
o 

t t 
<C E\x s - x\ 2 ds + C \ E sup \y s - y x s \ 2 ds 

J J u<s 



f 

< Ct 2 e 2 + C [ Esup \y u - y x u \ 2 ds. 

J u<s 


By virtue of Gronwall's lemma, one gets 

£sup \y v - y x ,\ 2 < Ct 2 e 2 exp(Ct). 
t'<t 

In particular, 

E sup \y v -y x ,\ 2 < Ct{e) 2 e 2 exp{Ct{e)). (13) 

t'<t(e) 

So the second term in ()12|) does not exceed the value exp(Cfbt(e)h , )i , ~ 2 Ct(e) 2 e 2 
which is o(exp(Kt(e))) for any K > 0. Indeed, exp(t(e)(Cfbi/ — 
K))h>~ 2 Ct(e) 2 e 2 — > due to the assumption t(e) = o(log£: _1 ), e — > 0. This 
proves (fTUjl. 

Notice that the bound (|10|) is uniform w.r.t. < 6 and a;', x, yo- Since 
the function is continuous, we get on the set {\x to — x\ < 5(u)}, 
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sup 

x',x,y ,t ,\l3\<b 



t +t(e) 



logE exp \p / f{x',y*)ds \\F to \ - t(e)H(x', x, 0) 



to 



if 8{y) is small enough. 
Step 2. Let A < (1 + 



c 



-H{y)/2 = A(u) and N = Ae- 2 t{e)- x . Then 



sup < s <jvt( £ ) \x s -x \ < S(u)/2. Let |x-x | < S(u)/2. So, supo^^) \x a —x\ < 
5(v). In particular, \xkt( £ ) — x\ < 5{u) for any 1 < k < N. By induction, we 
get from (fTlj) for such k, 



exp(kt(e)H(x' , x, (3) — ukt(e)) 

(kt(e) 

< exp(kt(e)H(x, x, 0) + iskt(e)), 
or, after the time change, 

exp(kt(e)H(x',x, (3) — vkt{e)) 

( kt(s)e- 2 \ 



< £"exp 



V 



f(x',Y s )ds 



J 



< exp(kt(e)H(x', x, (3) + ukt(e)). 
Since H is continuous then we obtain for k = N, 

exp(e- 2 AH(x',x,(3) - vAe' 2 ) 



<Eexp (^e- 2 J f(x',Y s )d)j 
< exp(e~ 2 AH(x',x ,P) + vAe~ 2 ). 



(15) 



Lemma is proved. QED 
The next Lemma is an improved version of the Lemma 7.5.2 from Freidlin 
and Wentzell. Although we will not use it, its technique is essential. 

Lemma 6 [Freidlin (1978), Freidlin and Wentzell (1984)]- Let S(ip) < oo. 
If ip n is a sequence of step functions tending uniformly to if in C[0, T]; R d ) as 
n — > oo ; then there exists a sequence of piecewise linear functions \ n (with the 
same partitions) which also tend uniformly to if and such that 



hmsup f L{r s ,X n s )ds<S{ V ). 
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Moreover, one may assume without loss of generality that for any s there exists 
a value 

(3 S = argmax(/%: + - # (C C P)) 

and 

//■?/' is c/ose enough to ip™ then there exists a value 

(3 

L(C, if, a) > if, *?+) + (a - A Va ^ 

and 

L(Ci,x;)^L(C,ftx: + ), 

We added to the original assertion the property that Xt ma y be chosen 
piecewise linear. Indeed, such functions are used in the proof; see Freidlin and 
Wentzell (1984), Section 7.5. The existence of (3 S asserted in the lemma also 
follows from the proof; see Freidlin and Wentzell (1984) or Freidlin (1978). 
Assertions about %[) and j3 s also added to the original assertion can be deduced 
from the proof using similar arguments. 

In fact, there is a little gap in the original proof, namely, an additional 
assumption was used which was not formulated explicitly. This is why we 
have to present a precise statement and give necessary comments on it in the 
Appendix. 

4 Proof of theorem 1 

l. First part of the proof: the lower bound. Let S(tp) < oo, and v > 0. 
To establish the lower bound, we will show the inequality: given any 
v > 0, and any 5 > 0, we have for e > small enough, 

£ 2 logP,(p ,r(X £ ,y9) < 5) > -S(<p) - v. 

Denote H(x,/3) = H(x,x,[3). The existence of the limit H(x,x',-) for 
any x, x', and its differentiability and continuity are asserted in Lemmas 
3 and 4. Throughout the proof, we may and will assume that for any 
s, L(ip s ,ip s ) < oo. Indeed, this may be violated only on a set of s 
of Lebesgue measure zero. Notice that due to the boundedness of the 
function /, this inequality implies sup s \ip s \ < \\f\\c- Indeed, for any 
\ a \ > \\f\\c, we have L(x,a) = +oo. 

In the sequel, both X Q = x and Y — y are fixed, hence, the probability 
symbol P will be used without indices. 
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2. We are going to reduce the problem of estimation from below the prob- 
ability 

P(p(X, <p) < 5) 

to that for the probability 

P{p(X^,p) <8'), where Xf := x + [ f(4> s , Y s )ds, W>, 

Jo 

and further to 

P(p(X^ x )<6'), 

where both ip, x approximate ip. The rough idea is eventually to choose 
a step function as ip and piecewise linear one as x, however we are going 
to perform these approximations gradually. A step function is needed 
because we only have a technical tool - the Lemma 5 - established for 
this very case. A piecewise linear ip is not necessary, but convenient. 
Eventually we consider a finite-dimensional subset of the set {p(X, p) < 
5}, of the form (slightly abusing notations which will be explained in the 
sequel) 

{p(Xt xa) < S[,p(Xf A , X 2A) <5' 2 ,..., p(Xt, X t) < S't/a}, 

with appropriately chosen A, X^, deterministic curves ip, x, and con- 
stants 5' k : in particular, we will choose 5[ « 5' 2 « . . . « o~' T / A << 5. 

While performing all these approximations, we need to establish simulta- 
neously a special property: at any point s, the Fenchel-Legendre adjoint 
to the Xs variable (5 S = /3 s [ip s ,Xs] (see below) can be chosen uniformly 
bounded. 

There will be several, - actually, a lot of, - small constants chosen con- 
sequently throughout this proof. In the beginning, 5 > and v > are 
fixed; due to the Lemma 5, we have also A(z/) and Here we prompt 
in advance the order of the choice for most of them (by this diagram we 
do not claim that every following constant only depends on the previous 
one, just the order): 

b & 5' I— > 5' I— > A & 5" I— > 5' m I— > i— > z m -\ i— > v ' m _ x . . . i— > 5[ \— > z± \— > v [. 

Remark. Emphasize that the final set (above) will be a subset of the 
{p(X,ip) < 5}, however, it is not necessary that 

{p(v ? A, Xa) < 5[, p(y?2A, X2A) < 6' 2 , . . . , p(ip T , Xt) < 5' T/A }, 
that is, the curve ip itself does not have to belong to this subset. 

3. For any nonrandom curve ip, - although we will apply this firstly to (p, 
but other functions are also necessary for the analysis below, - we have, 
due to the Lipschitz condition on /, 

{p(X, tp) < 6} D {p{X^ x) < 5'} (16) 
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if 5' and A := po.tOPjVO are small enough with respect to 5. (A small 
constant A > is used just within this step.) E.g., 5' < 5(e CT CT + 
l) -1 /2, A < 5{e CT CT + l)- x /2 suffice, see below. Indeed, 

X t = x+ [ f(X s ,Y s )ds, Xf = x+ [ f(i/>„Y a )ds, 
Jo Jo 

thence, 

\X t -Xf\< [ \f(X„Y a )ds-f{i/> s ,Y,)\ds<cf \X s -^ s \ds 
Jo Jo 

<C [ \X S -Xf\ds + C [ \Xf- X s\ds + C [ \xs~A\ds; 
Jo Jo Jo 

so on the set {p(X^, \) < $'}, 

\x t - x}\ <C [ \X S - Xf\ds + CS't + cxt, 
Jo 

and, moreover (on the same set), 

sup \X V - Xf, \ <C I sup \X S , - Xf,\ds + C{5' + X)t, 

0<t'<t Jo 0<s'<s 

which implies by Gronwall's inequality that on the same set, 

p(X,X^) < e CT C(5' + A)T. 

Now, 

p(X, cp) < p(X, X*) + p{X*, x) + p{x, <p) 

< e CT C(5' + A)T + 5' + A = (5' + X)(e CT CT + 1). 

Therefore, (fTBj) holds true. E.g., 

5' < 5(e CT CT + l)- x /2, A < 5(e CT CT + l)~ l /2 

suffice. In particular, it is true that 

{p(X }i f)<5}D{p(X^ x )<S / } 1 

if 5' and A are small enough with respect to 5. This bound will be used 
while establishing a lower bound. 

4. While establishing an upper bound, an opposite inclusion will be useful, 
{p{X, <p) < 5} C {p(A^, x ) < 25{KT + 1)}, (17) 
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if A = max (p((p, ip), p(p, x)) < 3. Indeed, 

\X t -Xf\< f \f(X s ,Y s )ds-f(^ Sj Y s )\ds<K[ \X s -ip s \ds 
Jo Jo 

<K \X s -p s \ds + K \tp s - p s \ds; 
Jo Jo 

so on the set {p(X, p) < 5}, 

\X t -Xf\ < K5t + KXt, 
and, moreover (on the same set), 

p{X,X^) < K{5 + X)T. 
Now, (|17j) follows from the inequalities, 

p(X*, x) < P(X, X+) + p(X, p) + p( X , P) 

< K(5 + A)T + 5 + A. 
5. Our next goal is the choice of an appropriate x = <fi b ] essential is to 

T 

keep the integral f L(p s ,p b s ) ds close to S(ip). Suppose for some s G 
o 

[0,T], the set {a : L(p s ,a) < oo} has a non-empty interior, - for the 
latter we will use a notation C° [/, p s ] , - with respect to its linear hull 
C[f,p s }. Since L(p s ,p s ) < oo, this value is attained as a liminf of 
the values L(<p s ,a), a G C°[f,p s ], as a — > p, see Rockafellar (1970). 
It is a property of any such a that there exists a finite adjoint vector 
(3 = argmax /3 (a/3 — H(<p s ,/3)) given a, although this adjoint may be not 
unique which we will discuss shortly. Notice that, in particular, we have 

H(ip s , (3) = {af3 - L(p s , a)), and L(p s , a) = (ap - H(p s , (3)). 

Then we choose a vector p s := a G C°[f, p s ] so that the value L(p s , p s ) 
is close enough to L(p s , p s ). 

The adjoint /3-value is unique iff L is different iable at a, which is also 
equivalent to strict convexity of H at f3 (see Rockafellar (1970)). If this 
is not a case, that is, L is non-differentiable at a, then there is a sub- 
gradient to the graph of L at a which is a non-trivial cone (although 
its dimension may be less than d). In this case one can choose various 
adjoint vectors /3's. Although not unique, both <p s and P[p s ,p s ] can be 
chosen as Borel functions of s (due to the Measurable Choice Theorem). 
Denote thus chosen adjoint by (3[p S) p s }. 

If the set C°[f,(p s ] is empty, one can choose f3[p s ,p s ] = 0, see Ap- 
pendix A. 
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Now set 



(fit := x + / ip s ds 



f 

Jo 



We can choose this new curve to be as close to (p as we like, and the 
values f Q L((p s , (p s ) ds and J Q L(ip s , <p s ) ds are arbitrarily close to each 
other, too, say, 



T 



L(ip s ,<p s )ds - S(ip) 



< u/3. 



6. For any s, let us choose a (measurable) vector a s such that L((p S: a s ) = 0. 

Notice that L((p s , •) > since H(tp s , 0) = 0; moreover, H(tp s , •) is convex 
at the origin, hence, there does exist a such that L(ip s , a) = 0. If not 
unique, this vector can be still chosen measurable due to the Measurable 
Choice Theorem. Moreover, \a s \ < \\f\\c for an y vector where L is finite. 

It is shown in the Appendix C that a s G C° [/, <p s ] , if the latter set is 
not empty. A corresponding adjoint vector j3 (i.e. (3 = argsup [(• ,a s ) — 
H((f s , •)]) does exist, and may be chosen as a zero- vector in R d that we 
denote by 0. This vector is, indeed, one of (if not unique) adjoint vectors 
for the vector a s , since H(ip s , 0) = 0, and 0a s — H(ip s , 0) = 0. If unique 
(that depends on the differentiability of the function L, or, equivalently, 
on strict convexity of the function H), this adjoint is precisely 0. 

7. Next, given b, define 
{$8l(\P[<P8,$s]\ < b)+& 8 l(\P[(P3,0s]\ > b)) ds. 



x + 



Since |a s | < we can find such a b that the curve (f b is still as close 

to (p as we like in the uniform norm, and the values j Q L(cp s: (p b s ) ds and 
L(ip s , ips) ds are arbitrarily close to each other. 

At the same time, if for some s we have \(3[ip s , <fi s ]\ > b, then the /5-value 
is transformed into /3[(p s ,(p b s ] = (3[ip s ,a s ] = 0, so that for any s e [0,T] 
the inequality holds true, 



\P[<P.,rt\\<b. 



We have, 



{p(X : <p)<5}D{p(X,<p b )<5/2}, 



if b is large enough. 

8. Next, the values of rate functions Sf(<p) and S" p ((/9 b ) are arbitrarily close, 
say, \Sf(<p) - Sf(<p b )\ <u/3, so that 



\S*(<p b )-S(tp)\<2v/3, 
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if b is large enough. Moreover, in addition, 



{p(X,cp b )<S/2}D{p(X^cp b )<~S'}, (18) 



if 5' and A = p(ip, tp b ) are small enough with respect to 5; hence, we can 
fix the values b and 5' here. 

9. The next transform is the change of both (p and (p b so that the first 
becomes a step function, while the second becomes piecewise linear on 
[0, T]. In the meantime, all adjoint /3-values will remain bounded. 

Consider the approximations 

■0s = V- , Km(s+a)-a ? Xs = ^ ' K m {s+a)-ai 

where m = T/A is a positive integer, K m (s) := [s/A]A, and a is 'almost 
any' value from [0, T]; we assume ip s = <p , s < 0, and ip b s = ip b , s < 0. 
Since sup 4 \ip t \ < oo, we have p(ip,ip) — > 0, m — > oo. Next, it is well- 
known 3 that there exists such a subsequence m — > oo (we keep notation 
m for this subsequence) that almost surely w.r.t. a, the values of the two 
integrals are close to zero, 

T nT 

\<Ps-Xs\ds+ \L((p s ,(p b s ) - L(ip a ,x,)\ ds m 0. (19) 
o Jo 



3 Possibly this folklore fact can be found in N. V. Krylov's Lectures on Stochastic Processes 
(Moscow State University, 1980s, Russian, vol. 2; the English translation of this book has 
been published recently by the AMS). For the reader's convenience we provide a folklore 
proof here. Indeed, due to a standard technique, it suffices to show for the first integral that 

/ / \ ( P b Km (s+a)-a- ( P b s\ dads ->Q, m^OO. 

Jo Jo 

Let g s , < s < T, be a smooth bounded function such that J \ip b s — g s \ds < v; all functions 
are extended to — oo < s < oo so that all vanish outside [0,T]. Then 



J da J ' \<P b Km (8+a)-a- 9 Km (s+a)-a\ds = J ds j ^ |^ b Km(s+Q )_ a - g Km ( s+a )-a I da 

= VA A" 1 / l^^gslds) = / \<j> a -g,\ds<v. 

l=1 \ J(i-1)A J J 

Hence, it suffices to show that 

/ / \g Km (s+a)-a - 9s\ dads -> 0, m^oo, 
Jo Jo 

which for smooth functions follows from the Lebesgue dominated convergence theorem, or 
even from uniform convergence under the integrals (of course, the smooth function g with a 
compact support is bounded). In turn, the existence of a smooth function g claimed above 
is due to the property that smooth functions are dense in Li[0,T]; this is, e.g., because so 
are step functions, while the indicator of any Borel set may be approximated in this function 
space by a finite set of intervals, and any indicator of an open interval can be smoothed. In 
turn, the 'standard technique' above means that one chooses next m n so that the Lebesgue 
measure of the set {a : J |v ?fc Km (s+a)-a — ¥ h s \ ds > 2~ n } does not exceed 2~ n , then finally the 
almost sure convergence is due to the Borel-Cantelli Lemma. The reasoning for the second 
integral is similar: we treat the integrand as a function of s and use the same freezing. 
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So, we can choose a value m from this subsequence (m — > oo) so that, 
firstly, A < A(u) (a value from the Lemma 5); secondly, J Q T \L(ip s , ip b s ) — 
L(ip s , Xs) \ ds < u/3, so that 



\S*(x)-S(<p)\<w, 
thirdly, J Q T \(p h s — Xs\ ds < 5'/ 10, and finally, 



{p(Xt, cp b ) < 5'} D {p(X^x) <S'x 9/10}, 



(20) 



(21) 



and (see above for 5' small enough (i.e. we can fix the value 5' 

here), 

' : ' (22) 



{p(X^, X )<S'x 9/10} D {p(X^, x) < S'}, 



for the latter we need p(tp, x) + p{<P, V') ^° be small enough which means, 
in particular, m large enough. 

10. For simplicity of presentation, we assume that a = 0; otherwise the 
discretisations below should be read (p A = (<fA-a, y^A-a, . . . , fmA-d, <fr), 
where a = a — [a/ A]; this general case is considered similarly. So, we 
assume a = 0, and denote y? A = (<^a, V^a, • • • , V^a), ^A = T. Since 
|| /|| c < oo, we have (see Freidlin and Wentzell, proof of the Lemma 
7.5.1), 

' ' (23) 



{p(X^x)<5'}D{p((X^, X A )<5"}, 



if 5" and A are small enough, 

8"<5"{5') and A<A(5') 



(24) 



(however, A < A(S") is not required!), and assuming all our curves start 
at xq at time zero (hence, we do not include the starting point into the 
definition of <p A ). Here for discretised curves we use the metric, 

P(^ A , X A ) := sup |^ fc A - x*a|- 

k 

Now, we are going to estimate from below the value in the right hand 
side of the inequality, 



P(p((X^, X A ) < 5") > E nr =1 1{\*L - **a| < 5[) 



(25) 



where 5[ < 5' 2 < ■ ■ ■ < 5' m = min(5(z/), 5"), i — 1, . . . ,m, and 5{y) is from 
the Lemma 5; here all values 8[ and certain auxiliary values Zi will be 
chosen in the next two steps as follows: 

mv H {S' k -i + z k -i) + -5' k _ x < 1 51 k d<y» & m H (5' k )<v, 

where < K < 1, and m g stands for the modulus of continuity of any 
function g with respect to all its variables restricted to |/3| < b + 1. 
Emphasize that 5" and A can be chosen arbitrarily small at this stage, in 
particular, in addition to they should satisfy the conditions of the Lemma 
5 that will be used in the sequel, that is, we require also 5" < 5{u) and 
A < A(i/). Hence, both 5" and A are fixed at this stage. 
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11. Now everything is prepared for the lower estimate. We start with the 
estimation of the conditional expectation E(I(\X'^ lA — Xm/s\ < S' m ) \ 
F( m -i)A) on the set {|X^_ 1)A -X(m-i)A| < <*m-i}> or > in the other words, 

on the set {Xf m _ l)A = Vv_i )A } with mm-i)A ~ X(m-i)A| < $' m -i- Let 
us apply the Cramer transformation of measure. Let \(3\ < b, we will 
choose this vector a bit later (as P[ip( m -i)A, X(m-i)A+}) ■ We get, 

E (l{\XtA ~ X^aI < 0*W-i)a) = (l{\X+t - XmA\ < S'Jx 

x exp (s- 2 (3(Xi A -Xf m _ 1)A ) +e- 2 AH^^ m _ u ^ |F (m _ 1)A ) , 

where E 13 is the (conditional) expectation with respect to the measure 
P 13 defined on the sigma- field F mA given F( m _i) A , by its density 



dP 13 . 
~dP 

where 



) = exp (e-' 2 (3{Xt A - Xf m _ 1)A ) - e -*AH$'@ (m - 1)A , (3) 



2 A U £ ^( 



e-tAHtftf^P) = logE (exp (V 2 /?(X r t A - Xf m _ 1)A )) |F (m _ 1)A ) . 

(We remind that ^( m -i)A = Xf m -i)A-) By virtue of the Lemma El on 
the set {\X^ m _ 1)A - X( m -i)A| < Kn-\} we estimate, 



E 



I(\ X tiA ~ XmA\ < 6' m ) | -F( m -l)A 



E? 



I(\Xi A -XmA\ < S' m 



x exp (e- 2 (3(Xl A - Xf m _ 1)A ) - e- 2 AH^^ {m _ l)A ,P)) | F (m _ 1)A 

> E? (l(\X't A - X mA\< S'J exp (S- 2 AP ((XmA - X(m-1)A)/A) 



^{H(ip {m _ 1)A ^ {m _ l)A , (3) + v) 



(S' m +S' m -i) 



\F( m -l)A 



(26) 



Let us choose (3 = (3(m) = /3[-0(m-i)A,X(m-i)A+] {= 
argmax /3 (/3x(m-i)A+ _ #("0(m-i)Aj P))}- As was explained above, 
|/?(m)| < b, moreover, 



P{m)x(m-1)A+ - H(lp {m _ 1)A ,p(m)) = L(^( m -l)A,X(m-l)A+), 



and 



X(m-1)A+ = VpH(ip( m _ 1)A ,p(m)). 
So (|2El implies (with P = P(m)), 

E (l(\Xt A - XmA \ < 5'J\F {m -i)A) 
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> exp (-e 2 A(L(^ (m „ 1)A , X(m-i)A+) +v)-e 2 (5' m + 5'^)) x 
x exp (^-e- 2 A(H(4j {m _ 1)A ,ip {rn _ 1)A ,(3) - H(ifj {m _ 1)A ,if) (m _ 1)A , 

^ /( | X ^ a _ XmA | < ^J|F (m _ 1)A ) 
> exp (-£- 2 A (L(ip (m _ 1)A , X(m-i)A+) + 2i/) - £~ 2 (c^ + <4_i)) x 

( /( |^ A - XmA | < ^ m )|F (m _ 1)A ) . (27) 

We have used the uniform continuity of H(x,-,(3) over \/3\ < b and 
x e R d : 

|#(^(m-l)A,$(m-l)A,/?) ~ #0(m-l)A, ^(m-l)A, /?) I 
< TOff(|^(m-l)A - ^(m-l)Al) < "MC-l) < 27 

(here stands for the modulus of continuity of H for \f3\ < b with b 
fixed), as 8' m _ x is small enough. 

12. Let us show the bound 

E p {m) (jflj^ - XmA\ < O^W)a) > 1 - exp(-C m A £ - 2 ) (28) 

with some C m > which may depend on u, on the set {\ip( m -i)A — 
X(to--i)a| < ^m-i}; ^ £ 1S small enough. There exists a finite number of 
vectors v x , v 2 , ■ ■ ■ , Ujv such that ll^fcll — 1 V/c, N = 2d (any orthonormal 
basis would do accomplished by its "symmetric" transformation, i.e. with 
each coordinate vector v we consider — 1> as well), and 

^M(/(|^ A - XmA |>^)|F (m _ 1)A ) 

< ££Ll ^ M (/((^A - *(t- 1)A - XmA + X(m-1)A)«* 

> - 8' m _ x ) | F (m _i )A ), 

given {\X^ m _ 1)A - X(m-i)A| < ^m-iK where k = (2/iV) 1 / 2 (notice that 
K < 1). Given any z/^_ x > (a new constant which has nothing to do 
with v and will be fixed shortly; we need it so to say only locally, while 
establishing the inequality (|2~8*j0. we estimate, for any v := Vk, < z < 1, 

EP{m) (/((^ A - Xf m __ 1)A - X mA + X(m-1)A)V > <8' m - 8> m _ x )) |F (m - 1)A ) 

< exp(-(8' m - 8' m _ x )AzKe- 2 ) exp (Ae- 2 [-^X( m -i )A + 
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+F £ ^ (m _ 1)A ,/3(m)+^^ 

< exp(-(5' m - S'^JAzks- 2 ) exp (As- 2 [-zvx( m ~i)A+ 

+F(-0( m -l)A, lp(m-l)A, f3{m) + Vz) 
-F(VV-l)A,'0(m-l)A,/3("'O) +2^^_ X ]J , 

if e is small enough. Denote 

-[5(^( m _l) A ,)/S( m _l) A ,/3(m) + f z) - i^(^(m-l)A, ^(m-l)A, 

We have, h(0) = 0. Moreover, 

^'(0) = - + X(m-l)A+V ~ V^if (^(m-l) A , ^(m-l)A, /9(m))v 

= (<^m - ^m-l) K + V^if(^(m-1)A, ^(m-l)A, f3(m))v 

-VpH(ip (m _ 1)A , ^( m _i)A, (3{m))v 

(where my^ stands for the modulus of continuity of the function V pH 
given < b + 1 {b + 1 is needed for the sequel, although here b 

would be enough)), because X(m-i)A+ = Vpiif (t/>(m-i)A, ^(m-i)A, /3(m)). 
The latter inequality, C m _i = (£„ — — mvfl(^m-i) > 0, holds true 

provided &' m _\ is small enough in compare to (5' m — ), e.g., 

or, equivalent ly, 

m V H(^_i) + - C-i < 2 ( 30 ) 

Moreover, since V pH is bounded and continuous due to the Lemma 01 
then h'(z) > C m __i/2 for small z, say, for < z < z m -\ (thus, z m _\ has 
been chosen here). Indeed, 

h\z) = (5' m - S'^-^K + X(m-1)A+V 
-V/3iJ(^( m „i)A,^( m -l)A,/5M +VZ)V 
= (Kn ~ S 'm-l) K + V /3 if(V'(m-l)A, ^(m-l)A, f3(m))v 
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-V^-H"(^(m-1)A, ^(m-l)A, + V z)u 

> ( 5 m - - ™Vff(<4-l + *)• 

So, h(z m -i) > C m -iz m _ 1 /2, provided z m _\ along with ft m _ x are small in 
compare to (S' m - 8' m _ x ), e.g., 

m VH {5' m _ 1 + Zm-j) < {5' m - S' m _ 1 )K/2, (31) 

rather than (J3DJ). Hence, under assumption of (|3*T|). the r.h.s. in (|2*9|l 
with 2! = z m does not exceed the value 

exp(A £ - 2 (2z/^_ 1 - < exp(~C m ^Az m ^e- 2 /4}) 

if we choose 

^' m _i < C m _i2 m _i/8. (32) 

Remind that the constant z^-i should be fixed in the beginning of this 
step of the proof; hence, we can do it now, once we have chosen z m _i, 
since the latter does not require any knowledge of u' m _ v This gives the 
bound, given {\Xf m _ 1)A - X(m-i)A| < $' m -i}i 

E p(m) (i(\x+i - XmA\ > ^)|F (m _ 1)A ) < exp(-C m -iA £ - 2 /4), 

which is equivalent to ([28)1 . In turn, (|28|l implies the estimate 

P (\ X mA _ XmA\ < <%jF( m _i) A ) 

> exp (-£- 2 A(L(^ (m _ 1)A , X(m-i)A+) + 3i/) - £ _2 (5^ + tf^-i)) » 
if e is small enough. Indeed, z/ being fixed, one can choose e so that 
1 -exp(-C m _iA£:~ 2 ) > exp(-l) > exp(-z/(Ae~ 2 )). 

13. By "backward" induction from k = m to k = 1, choosing at each step 
8' k _ x and Zk-i small enough in compare to S' k — S k _ v 

mvir(Ci + + f d < 1 5 'k, & d < **/2, & < " 

(33) 

(cf. (JSIJ), as well as all auxiliary values Cfc_i, we get the for e small 
enough the desired lower bound: 

P(\xi m -^ Am \<s' m ,...,\xi- ipA \<s[) 

> exp (-E- 2 A E^l(^(^(m-i)A, X(m-i)A+) + 3l/) - 2£- 2 

= exp (-£- 2 ( J T L(^ s , *,) rfs + 3z/T) - Ae- 2 5' m ) 
>exp(-£- 2 (5oT(^) + ^(3T + 2))), £ ^0, 
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provided 4S' m < u, and due to ([2U|h This is equivalent to (JSJ). This bound 
is uniform in x G E d , \y\ < r, and <p G for any r, s > (similar to 

the Lemma 7.4.1 from Freidlin and Wentzell (1984)). 

For the reader's convenience we remind the order of all choices by re- 
peating the diagram in the third step of the proof. The (any) constants 
8 > and v > are fixed; due to the Lemma 5, we have also A(u) and 
5(v)\ all other constants are chosen consequently as follows: 

b & 8' i— > 8' I— > A & 5" i— > S' m I— > i— > z m _i i — ► i/ ^ n _ 1 . . . i— > ^ i— ► i— > v [] 

one may say that all these values have been constructed via v. Some 
other constants (namely, CVs) are constructed via (%,'s; the value m is 
defined as soon as A is chosen, as A = T/m; and eventually e should be 
small enough to ensure all our asymptotic inequalities. 

14. The property of the rate function S to be a "good rate function" can be 
shown as in Freidlin and Wentzell (1984), using the semi-continuity of 
the functionL(x, y) w.r.t. y and continuity w.r.t. x variable (see Lemma 
7.4.2 from Freidlin and Wentzell). 

15. Second part of the proof: the upper bound. Assume that the asser- 
tion is not true, that is, there exist s and v > with the following 
properties: 

V5 > 0, there exists 5o < 5, We, there exists e < e : 

P(p(X, > 6 Q ) > exp(-e- 2 (s - u)). 

In the other words, for some (hence, actually, for any) So > arbitrarily 
close to zero, there exists a sequence e n — >• such that 

P(p(X, $ x (s)) > So) > exp(-e- 2 ( S - u)). (34) 

We fix any such So > 0. 

16. Since / is bounded, all possible paths for any ip belong to some 
compact F C C[0, T; R d ]. Due to semi-continuity of the functional S^(<p) 
w.r.t. if), for any v > there exists a value S v ((p) > such that p(ip, if)) < 
S v ((p) and S(if) > s imply S^((p) > s — v/2. 

Since S^(ip) is lower semi-continuous w.r.t. <p , too, it follows that 8„(ip) 
is also lower semi-continuous w.r.t. ip. Hence, it attains its minimum on 
any compact. 

Consider F\ = the compact obtained from F by dropping the 8q/2- 
neighbourhood of the set $ x (s) = {<p G C[0,T; R d ] : ip = x, S(ip) < s}. 
Denote 8 U = inf^g^ 5 u ((p), and take any 8' < min (8 V /(AKT + 2), So/2) 
where K is a Lipschitz constant of /. 

Choose a 8' -net in F\, let tp , . . . , <p be its elements. All of them do not 
belong to $ x (s), hence, S(ip % ) > s' > s. 
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17. Note that 

N 



{p(X,$ x (s)) > 5 } c \J{p(X,^) < 5'}. 



i=i 

Then, by the Dirichlet principle, for any n there exists an index 4 i such 
that 

P(p(X, p l ) < 5') > N- 1 exp(-e~ 2 (s - u)). (35) 

There is a finite number of % = 1, . . . , N. Thus, there exists at least 
one i such that (|35|) holds true for this i for some subsequence n' — > oo 
and correspondingly e n i — > 0; however, we will keep a notation n for 
simplicity. 

We can rewrite (|33|) as 

P(p(X, ^) < 5') > ex V (-s- 2 (s - i/)), (36) 

since iV does not depend on e n , strictly speaking with a new z/ > 0; 
however, it is convenient to keep the same notation. Denote (p l =: y 3 ^')- 

18. Consider a sequence 5 f — » such that a corresponding function 

does exist for any 5' from this sequence. Recall that 5q is fixed. All these 
functions satisfy the inequality 

%>(<?')) > s ' > s > 
since p(v? J , $ x (s)) > 5o/2, and also 

S(<p($')) < oo, 

which implies 

sup \(pt(S')\ < C. 
t 

Due to the Arcela-Ascoli Theorem, it is possible to extract from this set 
of functions a subsequence which converges in C[0,T; R d ] to some limit, 
(p. Since p((p(8'),$ x (s)) > 5q/2, we have, p((p,$ x (s)) > So/2, hence, 

S(<p) > s, 

and, in particular, the lower bound (0) can be applied. However, due to 
the construction, the function (p satisfies one more lower bound, 

lim inf lim sup e 2 In P(p(X, tp) < 5') > -s + v. (37) 



4 Indeed, otherwise for any i, 

P(p(X, tf) < 5') < N- 1 exp(-e- 2 ( s - i/)), 

and 

P(p{X,* x (a)) >5 )<P {(jipiX^) < 5'}) <eM-e- 2 (s -v)), 



\i=l 



which does contradict 1)34(1 . 
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Indeed, the latter follows from (J55J) because, e.g., 

P (p(X, ^)<6' + p{y, y{5'))) > P(p(X, <pffl) < 6') > exp(-^ 2 ( S -z,)). 

Due to (jHZj), there exists 5' > such that for smaller <5"s (a sequence) 
lim sup e 2 In P(p(X, tp) < 5') >-s + u/2. 

In fact, this implies the same inequality for any 5' > 0, because with 
any 5' for which the inequality holds true, each greater value would do 
as well. Therefore, for any 5', there exists e > (arbitrarily small) such 
that 

e 2 In P(p(X, tp) < 5 f ) >-s + u/3 = -(s - v/3). (38) 
We are going to show that this leads to a contradiction. 

19. Consider the case S(tp) < oo. Remind that S(tp) > s. Denote 

L b (x,y) = sup (f3y - H(x, (3)), 

£ b {x,y) :=L(x,y)-L b (x,y) 

= sup(f3y - H(x,P)) - sup(py - H{x,(3)). 

P |/3|<6 

Consider the function £ b (<p t ,<p t ). We have, 

a <L{<p t ,cp t ). 

Moreover, 

and the function I is decreasing with b — > oo. Hence, given v > 0, one 
can choose a b > such that 

/ i\tp u (p t )dt< I//20. 

Notice that we have chosen b. Moreover, one can also choose a discreti- 
sation step A (see above, item of the proof) such that 

T 

e b (<p Km (t+a)-a,fi Km{ t+ a )-a) dt < ^A ' 

and, correspondingly, 

/ L b (tp Km{t+a y a , $ Km{t+a) _ a ) dt> s- i//10; (39) 
Jo 

again assume for simplicity of presentation that a = 0. In addition, we 
require A < A(z//20) (this notation is from the Lemma |3j). Hence, we 
have chosen A and m = T/A. 
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20. So, with a = 0, let 

A ■= P^t), Xt ■= Km (t), Xo = x. 

We have, with a unique C = 2(KT + 1) (see (fT7jl) and for any 5', 

P(p(X,f>) < 6') < P{p(X*,x) < C6') < P(p(X^ A , x A ) < C8'). 

Denote 5" = CS'. Let us choose 5" < 5 (u/20) (the notation from the 
Lemma 5 is used), and consider the following inequality, with the se- 
quence (51, 1 < i < m), 5' m = 5", constructed via the value u/20 instead 
of v (compare to (|33|) . where we can now drop the requirement related 
to mvff), 

m 

P(p(X,(p) < 6[) < E]Jl(\X^ A - X A \ < 

i=l 

In particular, we require 45" = A5' m < u/20, and YlT=i^'i — ■ Then, 
due to the Lemma 5 and using the calculus as in the steps (jliH12|) , - the 
only change is that now we need an upper bound, and an estimation of 
the indicator function can be skipped, - we get on the set {\X^ m _^ A — 

X(m-i)A| < b'm-i} and for an y \P\ < b , 

^(Ai^A-XmA|<^)|F (ni _ 1)A ) 

< E 13 [l(\Xi A -XmA\< S'J exp (s- 2 Af3 (( XmA - X(m-i)A)/A) 

-£- 2 A(if(V( m -l)A,^(m-l)A,/3) - u/20) + ^h 1 ) |F (m _ 1)A ) (40) 

(compare to (j^Bjl) . Estimate here I(\X% lA — XmA\ < S' m ) < 1 and drop 
the expectation sign, then on the set {\Xf_ x ~> A — X(m-i)A| < ^m-i}> f° r 
any \j3\ < b, 

E(l(\Xt A - X mA\<6' m )\F im - 1)A ) 

< exp (-E~ 2 Af3 ((x m A - X(m-i)A)/A) 
-e- 2 A(H(ij {m _ 1)A , Vv-da,/?)) + , 

< exp (-e- 2 A(3 ((x m A - X(m-i)A)/A) 

-S~ 2 A(H(^ {m _ l)A , V(m-1)A, P) - u/20) + , (41) 

once we have chosen m#(<5") < u/20 (remind that m# is the modulus of 
continuity of the function H on the set \/3\ < b), because of the inequality 

\lp(m-l)A - ^(m-l)A| < 5' m -l < S " ■ 
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Let (3 satisfy a condition, 

P(XmA - X(m-1)A)/A - H (V>(m-1)A, ^(m-l)A, /?) 
= SU P|/3|<6 (/5(XmA - X(m-1)A)/A - H(lp {m _ 1)A , ^( m -l)A, /?)) 
= ^ 6 (V'(m-l)A)X(m-l)A+)- 

Then, on the set {|X ( ^ 1)A - X(m-i)A| < V m -\}, 

E(l(\XtA-XmA\<5>J\F (m - 1)A ) 

< exp (- £ - 2 A(L fc (^ (m _ 1)A , X(m-i)A+) + ^ 2 A|? + • (42) 

Similarly, using induction and due to (jHHJ), we get 

P(p(X, X )<S l 1 ) 

< exp (s- 2 jL b ^ Kmit) , ^ Km(t) ) dt + e- 2 u/20 + Ae~X 
V o 

< exp (-e- 2 (s - i//5)) . (43) 
This evidently contradicts ()38j) where 5' may be arbitrarily small. 

21. Consider the case <p absolute continuous, and S((p) = oo. In this case, 
due to monotone convergence L b — > L, there exist 6 > 0, m and a e [0, T] 
such that 

/ L 6 (^t, t ) dt > s-v/20, f L\^ Km{t+a) _ a , <p Km( t +a y a ) dt > s-v/10. 
Jo Jo 

The rest is similar to the main case, S((p) < oo, and leads again to 

P(p(X, <p) < 5[) < exp (s- 2 (s - u/5)) . 
This contradicts (J3~8j) . 

22. Consider the last possible case, <p not absolute continuous. In this case, 
for any constant c, in particular, for c = ||/||c + 1, there exist two values 
< t\ < t 2 < T, such that \<p t2 — <p tl \ > c(t 2 — ti); indeed, otherwise 
(p must be Lipschitz with \(p\ < c. Therefore, for 5 < (t 2 — ti)/2, the 
probability P(p(X, <p) < 5) necessarily equals zero, because the event 
{p(X, 0) < 5} is empty. This evidently contradicts (J38j) . In all possible 
cases, we got to contradictions. Hence, the assumption is wrong, that is, 
the upper bound (J3J) holds true. The Theorem is proved. 
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APPENDIX 



A. Comments on the Lemma [HJ To explain that the LemmalHlis valid 
without additional assumptions, we have to review very briefly its proof and 
show those assumptions. 

Let = to < < • • • < t m = T be a partition, jk{P) '■= f t * 1 H{<p 8 , /3)ds, 
£k(o>) = sup /3 (a/5 — 7fc(/3)), Ak = {a : £k(a) < oo}, A° k its interior w.r.t. the 
linear hull L^ fe . 

The inequality S(ip) = J L(<p t , (p t )dt < oo implies 

m m 

sup ((<p tk - ift^J - 7fc(/3)) = ^4(v>t* -^_J < S'(^). 
fe=l fc=l 

Under additional assumption A° k ^ it is proved in Freidlin and Wentzell 
(1984) using the arguments from Rockafellar (1970) that for any v > 0, there 
exists a function <^ such that p(<p, (p) < v and there exist (5k such that 

tk(<Pt h ~ & k _J = - <Pt k .j)Pk ~ lk{Pk) (44) 

and 

^*-^i = V7*(A)- (45) 

The proof goes well if A£ 7^ V7c. 

Let us show that the same is true if A° k = for some fc's. The property 
A° k = is equivalent to dimL^ fc = 0. In this case, 7t(/3) = with some 
Cfc G Hence, £k( a k) < 00 means that £k(ock) = and for any other a, 
£k{a) = +00 and 7fc(/3) = So, we have 

- yt fc _i) = = - ^t fc _i)/3 - 7*(/?) 
for any /3. Let = 0. Evidently, 

<Pt h -Vtk-i = V7 fc (/3 fc ). 

Hence, in the case A£ = 0, one should not just change the curve ip s on the 
interval (tk-i,tk)\ that is, (|44j) and (|45|) are valid in this case also. 

The rest of the proof is not changed. For any step function (, one defines 
a piecewise linear \ by the formula 

Xo = <Ab Xs = VpHiCs, (3 k )), tk-i < s <t k , k = 1, 2, . . . , m. 

Then it is shown that ( n — > <p implies \ n — > <p due to the property that the 
convergence of smooth convex functions to the limit implies the convergence 
of their gradients. Then there exists a partition such that this construction 
gives one 

/ L(( t , Xt )dt<S(tp) + iy. 
Jo 

So, the lemma holds true without additional assumptions. The assertions 
about ( and (3 S can be shown similarly. 
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B. Comments on the property A° k ^ 0, and characterization of the 

set C°[f,x}. Denote the interior of A(x) = {a : L(x,a) < 00} with respect to 
its linear hull L A{x) by A°(x). Then A° k = <t=^ A^ipt^J = 0. In this section 
we show the following equivalence: 

card(f G R d : f = f(x,y),y e M) = 1 &imL A{x) = = 0. 

Since -A(x) is convex, clearly the first two conditions are equivalent. 

If {f(x, •)} contains only one point then H(x,f3) is linear w.r.t. [3; hence, 
A(x) consists of a unique point and A°(x) = 0. 

Now, let {f(x, •)} contain at least two different points, say, f(x : y±) 7^ 
f(x,y 2 ). Then there exists 1 < k < d such that (f(x,yi) — f(x,y 2 )) k 7^ 0. 
Denote M k = sup y f k (x,y), m k = inf y f k (x,y). Let < v < (f(x,y 1 ) - 
f(x, 2/2))fc/2. Take two points y' and y" such that f k (x,y') < m k + u/5 and 
f k (x,y") > M k — v/h. There exist two open sets B' C M and B" C M such 
that sup^, f k (x, y) < m k + v/A and infj, eS « f k (x, y) > M k - vjA. 

Since the process yf is a nondegenerate ergodic diffusion, there exists A > 
such that 

P(y x s G B', l<s<t)> A' -1 , G 5", 1 < s < t) > A* -1 , * -> 00. 

Let P = z(3 k where (3 k G E d is a fcth unit coordinate vector and z E R. 
Then for 2 > we have, 

z'H' 1 log Eexp(z/3 k [ f{x,y x s )ds) 
Jo 

> z-H' 1 log EeMzfa f f(x,y x s )ds)I(y x s G B" , 1 < s < t) 

Jo 

> z'H- 1 log{exp(^(M fc - u/2)t)X t - 1 } 

= M k - v/A + 1 --^z- 1 log A > t —^-M k - u/2, 

if z is large enough. In the other words, for large positive z one has H (x, z/3 k ) > 
z(M k — 2v). Similarly, for large negative z 

\z\-H~ 1 log Eex.p(zf3 k [ f{x,y x )ds) 

Jo 

> \z\~H- 1 log EeMzfa [ f(x,y x s )ds)I(y x s G B", 1 < s < t) 

Jo 

> |^|" 1 t" 1 log{exp(2(m fc + i//4)i)A* -1 } 

t — 1 t — 1 

= -(m k + v/A) H — 1^| _1 log A > — m k - v/2, 

\t\z\ is large enough. In other words, for negative z with large absolute values 
one has H(x,zj3 k ) > z{m k + v). Therefore, {a : a — j3 k 9, m k + v < 9 < 
M k -v} C A(x). 
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On the other hand, it is obvious that if a = (3^9, 6 G R 1 , with 6 > M k or 
6 < m k , then L(x,a) = oo, because m k z < H{x,(3 k z) < M k z, and, hence (say, 
if 9 > M k ), for z » 1, 

(3 k 9(3 k z - H(x, (3 k z) > (0 - M k )z -> +oo, z -> +oo. 

A similar calculus and inequalities are valid for any unit vector f3 . This 
shows, in particular, that dimL^(x) = dimLj(x), and, moreover, that 
La{x) = Lf(x). Since A(x) is convex, it shows also that the interior A°(x) 
w.r.t. Lm x ) is not empty, except only the case dim(L^( a ,)) = 1. Hence, the 
third condition is equivalent to the second one and the first. 

So, the condition A° k ^ is always satisfied if the set {f(x, •)} for any x 
consists of more than one point. In fact, if card{f(x, •)} = 1 for any x then / 
does not depend on y. In this case, one has nothing to average. 

Notice that our considerations above provide the following description of 
the set C°[f,x}: 

C°[f,x] = {ae R d : m p (x) < (a, (3) < M p (x), V|/3| = 1, with m p (x) < M p (x), 
and (a, (3) = M p (x), Vj/3j = 1, with m (x) = M (x)}, 

where mp(x) := ini(4t,f(x,y)), M p (x) := sup( A, f(x, y)). Moreover, it can 

V y 

be shown similarly that for any x, x (although we do not need it here), 

C[f,x,x] = C[f,x}. 



C. About a s G C°[f,(p s ]. Let x = tp s , a = a[x,x] as described in the 
proof of the theorem 1. If we show that for any direction v (a unit vector) 
satisfying the property m v < M v , the strict double inequality holds true 

m v < dH(x, zv)/dz\ z=0 < M v , 

z E R 1 , then it would follow a s E C°[f,(p s ]. Let v > and again two 
open sets B' and B" be chosen such that sup ygB / v f(x, y) < m v + v jl, 
and mlyfzB" vf(x, y) > M v — vjl. Let fii nv (B") be invariant measure for 
the event {yf G B"}. We can choose v and correspondingly B" so that 
fj,inv(B") < 1. Then, due to large deviation asymptotics for the process yf, for 
any fi inv (B") < ( < 1 there exists A > such that 

P (V 1 jf l(y x s G B") ds>^j< exp(-Af), * > * c . 

Denote A c = {t" 1 G 5") rfs < (}, ^ = jt" 1 jj G 5") rf S > (}, 

then for z > 0, 

Eexp(zv [ f(x,y x )ds) 
Jo 
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< Eexp(z [ {M v l{y x G B") + (M v - v)l{y x <£ B")) ds) 1(A$ 
Jo 

+Eexp(z f (M v l(y x G B") + (M v - v)l{y x # B")) ds) 1(A C ) 
Jo 

< Eexp(ztM v + zt(M v - v)) 1(A C ( ) + Eexp(ztM v ( + zt(M v - v)) l(A c ) 
< exp(ztM v + zt(M v -u)- zXt/z) + exp^tM^C + zt(M v - v)), 

hence, 

limsup limsup (tz^E exp(zv / f(x,y x )ds)<M v . 

z-^O t^oo Jo 

Similarly, using B' one can get 

lim inf lim inf {tz)' 1 E exp(zv / f(x,y x )ds) > m v . 

Thus, 

m v < dH(x, zv)/dz\ z=Q < M v . 

Therefore, a G C°[x, /]. 
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